Towards cost-effective and high-performance caching middleware for distributed systems

نویسندگان

  • Dongfang Zhao
  • Kan Qiao
  • Ioan Raicu
چکیده

One performance bottleneck of distributed systems lies on the hard disk drive (HDD) whose single read/write head has physical limitations to support concurrent I/Os. Although the solid-state drive (SSD) has been introduced for years, HDDs are still dominant storage due to large capacity and low cost. This paper proposes a caching middleware that manages the underlying heterogeneous storage devices in order to allow distributed file systems to achieve both high performance and low cost. Specifically, we design and implement a user-level caching system that offers SSD-like performance at a cost similar to a HDD. We demonstrate how such a middleware improves the performance of distributed file systems, such as the HDFS. Experimental results show that the caching system delivers up to 7X higher throughput and 76X higher IOPS than Linux Ext4 file system, and accelerates HDFS by 28% on 32 nodes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

C2P: Co-operative Caching in Distributed Storage Systems

Distributed storage systems (e.g. clustered filesystems HDFS, GPFS and Object Stores Openstack swift ) often partition sequential data across storage systems for performance ( data striping) or protection (Erasure-Coding) . This partitioning leads to logically correlated data being stored on different physical storage devices, which operate autonomously. This un-coordinated operation may lead t...

متن کامل

High Performance Distributed Objects using Caching Proxies for Large Scale Applications

Initial implementations of Middleware based on standards such as CORBA have concentrated on host and language transparency issues in order to demonstrate interoperability. They have largely adopted a No-Replication approach and have frequently neglected performance-at-scale issues. This has lead to a continuing deployment of either non-scalable Full-Replication approaches or ad-hoc messaging-ba...

متن کامل

Access control in ultra-large-scale systems using a data-centric middleware

  The primary characteristic of an Ultra-Large-Scale (ULS) system is ultra-large size on any related dimension. A ULS system is generally considered as a system-of-systems with heterogeneous nodes and autonomous domains. As the size of a system-of-systems grows, and interoperability demand between sub-systems is increased, achieving more scalable and dynamic access control system becomes an im...

متن کامل

Middleware Support for Scalable Services

Services targeted to the home environment must deal with software and hardware heterogeneity of client and server machines. Distributed object based middleware is attractive for building distributed services in such heterogeneous environments for a number of reasons. In particular, distributed objects can hide the complexities that arise from distribution and heterogeneity in the underlying sys...

متن کامل

Optimizing Power using Reconfigurable Middleware

In distributed environments, generic middleware services(e.g. caching, location management etc.) are widely used to satisfy application needs in a cost-effective manner. Such middleware services consume system resources such as storage, computation and communication and can be sources of significant power overheads when executed on low-power devices. Our goal is to develop a distributed middlew...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJBDI

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2016